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Abstract. The voter model and the Axelrod model are two of the main stochastic 
processes that describe the spread of opinions on networks. The former includes 
social influence, the tendency of individuals to become more similar when they in- 
teract, while the latter also accounts for homophily, the tendency to interact more 
frequently with individuals which are more similar. The Axelrod model has been 
extensively studied during the past ten years based on numerical simulations. In 
contrast, we give rigorous analytical results for a generalization of the voter model 
that is closely related to the Axelrod model as it combines social influence and 
confidence threshold, which is modeled somewhat similarly to homophily. Each 
vertex of the network, represented by a finite connected graph, is characterized 
by an opinion and may interact with its adjacent vertices. Like the voter model, 
an interaction results in an agreement between both interacting vertices - social 
influence - but unlike the voter model, an interaction takes place if and only if the 
vertices' opinions are within a certain distance - confidence threshold. In a deter- 
ministic static approach, we first give lower and upper bounds for the maximum 
number of opinions that can be supported by the network as a function of the con- 
fidence threshold and various characteristics of the graph. The number of opinions 
coexisting at equilibrium is then investigated in a probabilistic dynamic approach 
for the stochastic process starting from a random configuration. It is proved that, 
for large confidence thresholds, the probability of an ultimate consensus on any 
connected graph is strictly positive whereas, with high probability, any fraction of 
the opinions is retained by the dynamics in one dimension provided the confidence 
threshold is small enough. 
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1. Introduction 

Opinion and cultural dynamics are driven by social influence, which is the ten- 
dency of individuals to become more similar when they interact. At least when the 
number of interacting agents is finite and the network of interactions is connected, 
models including this aspect, such as the voter model introduced independently by 
Clifford and Sudbury (1973) and Hollcy and Liggett (1975), usually predict conver- 
gence to a global consensus. However, differences between individuals and groups 
persist in the real world. In his seminal paper Axelrod (1997), political scientist 
Robert Axelrod explains the diversity of opinions and cultures as a consequence 
of homophily, which is the tendency to interact more frequently with individuals 
which arc more similar. In the Axelrod model, actors are characterized by a finite 
number of cultural features. In Axclrod's own words, the more similar an actor is 
to a neighbor, the more likely that actor will adopt one of the neighbor's traits. 
Interactions take place on a finite connected graph G with vertex set V and edge 
set E. Each vertex x is characterized by a vector of n cultural features, each of 
which having m possible states, 

c(x) = {fi{x),...,f n (x)) where fj(x) € {1, 2, . . . ,m} for j = 1, 2, . . . , n. 

At each update, a vertex, say x, is picked uniformly at random along with one 
of its neighbors, say y. Then, with a probability equal to the fraction of features 
vertices x and y have in common, one of the features for which states are different 
(if any) is selected uniformly at random, and the state of vertex y is set equal to 
the state of vertex x for this cultural feature. Otherwise nothing happens. The 
system evolving in continuous-time with each ordered pair of neighbors becoming 
active at rate 1 can be modeled formally by the continuous-time Markov process 
whose state at time t is a function ct that maps the vertex set of the graph into the 
set of cultures {1, 2, . . . , m}™, and whose dynamics are described by the Markov 
generator La defined on the set of cylinder functions by 

!{/<(*) ± fi(y)} [g(cU y ) - g(c)] 

where y ~ x means that x and y are connected by an edge, 

c l^ y {y) = {h(y),---,fi-i{y),fi(%)Ji+i(y),---Jn(y)) 

and c l x ^ y {z) = c(z) for all z ^ y, and 

n 

f(x,y) = - E HM*) = fM} 

3=1 

is the fraction of traits vertices x and y share. Due to the finiteness of the graph, 
the system is always driven to an absorbing state: either an ordered configuration 
in which all the vertices share the same culture, or a disordered configuration in 
which different cultures coexist. The phase transition between these two regimes, 
which depends qualitatively and quantitatively upon the structure of the graph, 
the number of cultural features, and the number of states per feature, has been 
extensively studied by social scientists as well as statistical physicists based either 
on numerical simulations (see Gonzalez-Avella et al., 2005; Klemm et al., 2003) 
or simple mean field treatments (see Castellano et al., 2000; Vazquez and Redner, 
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Figure 1.1. Snapshots of the process at time 10,000. 



2007; Vilone et al., 2002) ignoring the structure of the network, and we refer the 
reader to Section IV. A of Castellano et al. (2009) for a review. 
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While there is a lack of rigorous mathematical results for the Axelrod model, 
we prove analytically part of the results previously observed through numerical 
simulations as well as new results about the maximum number of opinions that can 
be supported by a graph for another variant of the voter model closely related to 
the Axelrod model which includes both social influence and a confidence threshold 
e > 0, i.e., pairs of actors whose opinion distance exceeds this threshold do not 
trust each other enough to mimic each other. The network of interactions is again 
described by a finite connected graph but each vertex is now characterized by an 
opinion taking values in [0,1]. The opinion dynamics are similar to that of the 
voter model except that a vertex can mimic a neighbor only if the opinion distance 
between the vertex pair does not exceed the threshold e. The opinion dynamics 
are formally described by the continuous-time Markov process whose state at time 
t is a function rj t that maps the vertex set into the opinion space [0,1], and whose 
Markov generator Ly is defined on the set of cylinder functions by 

Lvg(v) = Hlviy) - < t} [givx^y) - g(v)} 

x£LV y^x 

where r\ x ^>y is obtained from r\ by assigning the value rj(x) to vertex y. Note that, 
when the confidence threshold is set to e = 0, any configuration is an absorbing 
state regardless of the cardinality and topology of the network of interactions: the 
dynamics are frozen so that consensus are not possible. In contrast, when e = 1, 
one recovers the (multitype) voter model, in which case the duality with coalescing 
random walks implies that on any finite connected graph a total consensus is reached 
with probability 1. For results about consensus times of the voter model on the 
torus, we refer the reader to the article of Cox (1989). When e £ (0, 1), the dynamics 
become more complex. Intuitively, agreements should emerge from the repeated 
interactions among agents, so one expects that the probability of a consensus is 
nondccreasing with respect to both the confidence threshold and the connectivity 
of the graph. This picture is supported by Figure 1.1 that shows realizations of the 
process on a 400 x 400 lattice with periodic boundary conditions for different values 
of the confidence threshold e. Numbers between parenthesis at the bottom of each 
picture, which represent the number of opinions at time 10,000, strongly suggest 
that the expected number of opinions at equilibrium is nonincreasing with respect 
to e. It turns out that the probability of a consensus depends upon the initial 
configuration as well: we will prove that, whereas absorbing states with a large 
number of opinions can be artificially constructed in a deterministic fashion, starting 
from a random configuration in which the initial opinions are independent and 
uniformly distributed, the dynamics tend to greatly decrease the number of opinions 
present in the system, which is again supported by the numerical simulations of 
Figure 1.1. 

Before stating our results, recall that throughout this article G = (V, E) denotes 
a finite connected graph with N vertices. We employ the notation ~ for the binary 
relation on the vertex set that indicates that two vertices are neighbors, i.e., con- 
nected by an edge. Finally, given e £ [0, 1], we let P e denote the law of the process 
with parameter e starting from independent opinions chosen uniformly at random 
in the interval [0, 1], and let E e be the corresponding expected value. 

Number of opinions at equilibrium static approach. The first step 
is to look at the number of opinions that may coexist at equilibrium. Note that 
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configurations in which all the agents share the same opinion, that we call consensus, 
are absorbing states, so the minimum number of opinions at equilibrium is trivially 
equal to 1. Let /x e (G) denote the maximum number of opinions that may coexist at 
equilibrium, a quantity that we shall call the e-opinion index, or simply the opinion 
index, of the graph. A key tool to investigate the opinion index is to observe that 
the stationary distributions are exactly the ones supported on the set of absorbing 
states. This is intuitively obvious since the underlying graph is finite, but will 
be proved rigorously in the next section by applying the martingale convergence 
theorem. The problem thus reduces to finding all the static configurations of the 
process, namely the configurations n : V — > [0, 1] such that 

\t](x) — Tj(y)\ > e or rj(x) = r/(y) for each edge (x,y) G E. 

The previous condition implies that, for any given graph G, the index fi e (G) is 
nonincreasing with respect to the confidence threshold e. Moreover, we observe that 
the condition is always satisfied when the threshold e = 0, therefore Mo(G) = N. 
In contrast, when e = 1, the process reduces to the multitype voter model that 
reaches eventually a consensus, therefore /ii(G) = 1. 

For intermediate values of the confidence threshold, lower bounds of the opinion 
index can be expressed as a function of the chromatic number x(G) of the graph. 
Recall that a graph is said to be c-colorable if one can color its vertices using 
at most c colors such that no two adjacent vertices receive the same color. The 
smallest number of colors needed to color a graph is called its chromatic number. 
Provided e < 1, if a large graph can be colored using only few colors then one 
can construct absorbing states in which a large number of vertices (those colored 
with the same given color) have distinct opinions arbitrarily close to each other. In 
contrast, if the chromatic number is large, then one expects the number of opinions 
at equilibrium to be rather small. Precisely, by letting [a] be the least integer not 
less than a, we have the following theorem. 

Theorem 1.1. If e < 1 and x = x(G) is the chromatic number of G then 
€<{ X -iy 1 Me (GQ = card(V) = N. 

e^ix-l)- 1 =► M £ (G) > max(riV/xl +l,re- 1 D- 

Theorem 1.1 combined with results from the theory of graph coloring gives us 
insight into how the topology of the graph may affect its opinion index. It is not 
the purpose of this article to review all the results of graph coloring, so we only 
give few applications. 

Corollary 1.2. If G is a planar graph and e < 1/3 then Me(G) = N. 

Proof: This is a straightforward consequence of Theorem 1.1 and the four color 
map theorem which states that any planar graph is 4-colorable. □ 

Corollary 1.3. If G is a triangle-free planar graph and e < 1/2 then Me(G) = N . 

Proof: This follows from Theorem 1.1 and Grotzsch's theorem which states that 
every triangle-free planar graph is 3-colorablc. □ 

Corollary 1.4. Assume that 1/2 < e < 1. Then, Me(G) — N if and only if the 
graph G is bipartite. 

Proof: If G is bipartite then it is 2-colorablc so Mc(G) = N by Theorem 1.1. To 
prove the reverse, assume that the graph G is not bipartite. Then it contains an 
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odd cycle. Let 2n + 1 be the length of this cycle. Without loss of generality, we 
may assume that at least n + 1 vertices in this cycle have opinion in [0, 1/2], which 
induces the existence of two adjacent vertices with opinion in [0, 1/2]. Since e > 1/2, 
for the configuration to be an absorbing state, these two vertices must share the 
same opinion from which it follows that Ht(G) < N. □ 

In general, Theorem 1.1 gives interesting results when considering graphs with 
small chromatic numbers, which gives rise to absorbing states with a large number 
of opinions. 

To understand the process on finite graphs with a large chromatic number such 
as complete graphs for which x(G) = N, we now give an upper bound of the opinion 
index /x e (G). This upper bound is computed by considering a certain decreasing 
sequence of subgraphs in which each member is obtained from the previous one 
by removing its largest clique. Recall that a clique in a graph is a set of pairwise 
adjacent vertices inducing a subgraph which is a complete graph. Let G be a finite 
graph, and W C V its largest clique. We introduce the following notations: 

- The number of vertices in W is denoted by w(G). 

- The subgraph of G induced by the vertex set V\W is denoted by ir(G). 
When the graph has more than one maximal clique, the subgraph 7r(G) is not 
uniquely defined, and the upper bound in the next theorem is obtained by taking 
the minimum over all the possible choices of the sequence of subgraphs. 

Theorem 1.5. Assume that G = {V, E) is a finite connected graph. Then, 

/i e (G) < min (min(w(G), [e- 1 ]) + /i e (7r(G))) 

choices 

< min (min(w(G), [e- 1 ]) + ••• + mm(w(ir If - l (G)) t le' 1 ])). 

choices 

The sum is stopped at N — 1 because 7r l (G) = whenever i > N. As previously 
explained, while Theorem 1.1 gives interesting lower bounds for graphs with small 
chromatic number, Theorem 1.5 gives interesting upper bounds for graphs with 
large connectivity. In particular, the result turns out to be optimal in the case of 
the complete graph for which /x £ (G) = min(iV, [e -1 ]). 

Number of opinions at equilibrium dynamic approach. Wc now in- 
vestigate the limiting number of opinions v t (G) that coexist at equilibrium in a 
dynamic context, namely for the process starting from the configuration in which 
the initial opinions rjo(x), x € V, arc independent and uniformly distributed in the 
interval [0,1]. It is important to point out that v t {G) is a random variable while 
the opinion index ^ £ (G) is a deterministic upper bound. 

For any finite connected graph, one has v t {G) = ^ C (G) when the confidence 
threshold is either equal to e — or equal to e = 1. The opinion index, however, 
might be rather crude for intermediate values of the confidence threshold e. This is 
due to the fact that stable configurations with /z e (G) opinions are often artificially 
constructed in a deterministic way and have in fact a small probability. For instance, 
the graph used to produce our simulation pictures is an example of bipartite graph 
therefore /U e (G) = 400 2 for all e < 1 according to Corollary 1.4. In contrast, 
our simulation results indicate that for reasonably large values of the confidence 
threshold, most of the opinions are lost. In support to this picture, our next result 
gives a general lower bound for the probability of a consensus that holds regardless 
of the topology of the graph. 



Opinion dynamics with confidence threshold 



7 



Theorem 1.6. Assume that e > 1/2. Then, 

P e (consensus) = P £ (y € {G) = 1) > 2e - 1. 

Theorem 1.6 follows from an application of the optional sampling theorem that 
shows that the set of agents whose opinion lies outside (1 — e, e), that can be seen as 
"extremist" agents, hits the empty set with probability 2e — 1. The proof uses the 
fact that all "centrist" agents, with opinion in the interval (1 — e, e), may interact 
with all "extremist" agents. Once all agents are "centrist" the system evolves 
according to a voter model so the population reaches a consensus. 

We now look at the long-term behavior of the process when e > is small. 
Observe first that, when the degree of the graph is uniformly bounded by a positive 
constant, say K, the number of edges in the graph is at most NK, hence 

P e {v e {G)^N) < P e (\r)o(x)-r) (y)\ < e for some (x,y) e E) 

< P A\Vo{x) -Vo(y)\ < e) < 2e card(£) < 2e NK. 

In particular, with probability close to 1, the number of opinions coexisting at 
equilibrium is equal to the number N of vertices provided e = o(l/N). Note that 
the upper bound above depends on the number of vertices, which is necessary for 
the number of opinions coexisting at equilibrium to be exactly equal to N, or 
equivalently, for the initial state to be an absorbing state. This is due to the fact 
that, when e > docs not depend on N, with probability close to 1 when N is 
large, the number of opinions at equilibrium will be strictly less than N since one 
can find with high probability at least one edge whose endpoints arc initially within 
opinion distance e. Such edges, however, are minoriy, so we conjecture that, with 
probability close to 1 when N is large, any given fraction c < 1 of the opinions can 
ultimately be retained by the dynamics provided e lies below a positive constant 
that only depends upon c and the local topology of the graph. 

Conjecture 1.7. For any c < 1, there exists eo(c,K) £ (0, 1) such that, uniformly 
over all graphs with degree at most K and number of vertices N , we have 

P e {Ve{G) <cN) < Cexp(--yN) for all e < e (c, K) 

and suitable constants C > and 7 € (0, 00) that only depends on K and e. 

Using an analogy with bond percolation, any configuration of opinions can be 
thought of as inducing a partition of the vertex set into clusters, where two vertices 
x and y belong to the same cluster if and only if there is a graph path in G from 
x to y with e-agreement along it. Peierls argument implies that, when e is suffi- 
ciently small, the size of a typical cluster at time decays exponentially, therefore 
the initial number of clusters can be made larger than cN with high probability. 
Then, under the assumption that the opinion distance between any two vertices 
belonging to adjacent clusters initially exceeds e, all the clusters are closed under 
the dynamics: an opinion can at most spreads within its parent cluster. In partic- 
ular, the ultimate number of coexisting opinions dominates the initial number of 
clusters, which suggests the conjecture. This argument breaks down if the initial 
configuration contains at least one cluster that is not closed under the dynamics, 
which occurs with probability close to 1 due to the presence of arbitrarily large 
clusters at time that contain a wide range of opinions, however the analogy with 
bond percolation supported by numerical simulations gives us the following insight 
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into the geometry of the ultimate configuration when the confidence threshold is 
small: most opinions that persist have only few representatives, those that originate 
from clusters that are initially closed under the dynamics, and few opinions have 
a significantly larger number of representatives, those that originate from clusters 
that are not closed under the dynamics. Based on a fairly different approach far 
from percolation theory, the conjecture can be established for path graphs, i.e., two 
of the vertices have degree 1 and all others degree 2. 

Theorem 1.8. Assume that G = {V,E) is a path. Then 

P e (i/ e (G) < (1 - ci e)N) < exp(-c 2 eN) 

for some constants ci, C2 > that do not depend on N and e. 

The conjecture when the graph is a path follows by taking e = (1 — c)/c\. In 
order to prove this result, we will look at the process coupled with the opinion 
dynamics that keeps track of the disagreements along the edges rather than the 
actual opinion at each vertex. It will be proved that the number of opinions on a 
path graph, and more generally the number of connected components in which all 
vertices share the same opinion on a tree, is nearly equal to the number of edges 
along which a disagreement persists, therefore the objective will be to bound the 
ultimate number of edges whose endpoints agree. When the graph is a path, as 
opposed to a tree with larger degree or a graph including loops, the dynamics on the 
edges can be expressed in a simple manner, which is the key to finding a suitable 
upper bound. This will be done by ultimately ignoring the spatial structure and 
exhibiting a connection with a simple urn problem. 

2. Proof of Theorems 1.1 and 1.5 

To study the opinion dynamics and characterize their equilibriums, it is conve- 
nient to construct the process graphically using an idea of Harris (1972). We first 
orient each edge e £ E arbitrarily. Each edge is further equipped with a Poisson 
process with parameter 2 whose nth arrival time is denoted by T„(e) together with 
a collection of independent coin flips by letting 

P(J7„(e) = +1) = P(£/„(e) = -l) = 1/2 for all n > 1. 

Poisson processes and collections of coin flips attached to different edges are inde- 
pendent. The process starting from any initial configuration is constructed from 
this graphical representation in the following way. Let e = (x, y) being oriented 
from x to y, and let a and b denote the opinion at x and y at time T n (e). 

(1) If |a — b\ > e then nothing happens at time T ra (e). 

(2) If |a — b\ < e and U n (e) = +1 then the opinion of y jumps from b to a. 

(3) If \a — b\ < e and U n (e) = — 1 then the opinion of x jumps from a to b. 

Any absorbing state of the process induces a stationary distribution. Since the 
vertex set is finite, it should be clear that any stationary distribution is in turn 
supported on the set of absorbing states, which we now prove rigorously using the 
martingale convergence theorem. 

Lemma 2.1. Assume that G = (V,E) is finite. Then, any stationary distribution 
is supported on the set of absorbing states. 
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Proof: For x £ V, let 0t(x) be the set of vertices with opinion 770(2;) at time i, that 
is the set of vertices whose opinion at time t originates from x at time 0, i.e., 

Q t (x) = {y eV :r) t (y) =r, (x)}. 

The set-valued process 0t(a;) may only evolve at times T n (e) when the oriented 
edge e connects a vertex y £ @t(x) to a vertex z £ V \ 0t(x). Without loss of 
generality, set e = (y, z) is oriented from vertex y to vertex z. 

(1) If \rjt(y) — ?lt( z )\ > £ then nothing happens at time T n (e). 

(2) If \r] t (y) — T} t (z)\ < e then 9 t {x) = card(0 t (x)) jumps to 9 t (x) + U n (e) at 
the arrival time T n (e). 

Since the random variable U n (e) takes values +1 and —1 with equal probabilities, 
this implies that the number of vertices with opinion 770(2;) is a martingale. In 
particular, the martingale convergence theorem implies the existence of a random 
variable 9 oa (x) £ {0, 1, ... , N} such that 

lim 9 t (x) — 9oc(x) with probability 1. 

t— ^00 

Since the state space of 9 t (x) is finite, there exist a stopping time T x almost surely 
finite and a random subset of V that we denote by ©00(2?) such that 

9 t (x)= 9 00 (x) and Q t (x) = Q 00 (x) for all t >T X . 

Finally, set T = max{T x : x £ V}. Then, T is almost surely finite and 

Q t (x) = Qooix) for all x £ V and t > T 

indicating that, starting from any initial configuration, the process eventually gets 
trapped into an absorbing state. This completes the proof. □ 

Lemma 2.2. Assume that G\ = (V,Ei) and G 2 = (V,E 2 ) with E\ C E 2 - Then, 
we have /i e (Gi) > /j e (G2). 

Proof: By Lemma 2.1, there exists a configuration 77 with (i e (G2) distinct opinions 
which is an absorbing state of the process evolving on G-2- In particular, we have 

\r)(x) — i](y) I > e or i](x) = 77(77) for each edge (x, y) £ E 2 - 

This property is true for the edges of Gi since E\ C E 2 , which implies that 77 is 
also an absorbing state for the process evolving on G\. In conclusion, we have the 
inequality fx e (Gi) > [i e (G 2 ). □ 

Lemma 2.3. Assume that G = (V, E) is a finite connected graph. Let {V\, V 2 } be 
a partition of V , and let Gi = (Vi, E{) be the subgraph induced by Vi. Then, 

He(G) < Me(Gl) + /i £ (G 2 ). 

Proof: Since Ei C E, any absorbing state 77 : V — > [0, 1] for the process evolving on 
the graph G induces an absorbing state r\i for the process on Gi. In particular, there 
exist two absorbing states for the processes evolving on Gi and G 2 , respectively, 
such that the sum of the number of distinct opinions in each configuration is larger 
than /J C (G). The result follows. □ 

Lemma 2.4. If G = (V,E) is a complete graph then /i e (G) = min(iV, [e" 1 ]). 
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Figure 2.2. \ = 3 and a = 0.001 



Proof: Let J = \e and let 77 : V — > [0,1] be a configuration with at least J + 1 
opinions. Then, there exist vertices xq,x\, . . . ,xj such that 

< 77(2.0) < V{xx) < ■■■ < r){xj) < 1. 
In particular, we can find j G {0,1,..., J — 1} such that 

177(^+0-77(^)1 < J- 1 = l/le- 1 ] < e. 

Since vertices Xj and Xj+i are neighbors (complete graph), the previous inequality 
implies that the configuration 77 is not an absorbing state. It then follows from 
Lemma 2.1 that J + 1 opinions cannot coexist at equilibrium so 

H*(G) < J = le- 1 ]. 

To conclude, it suffices to construct an absorbing state with min(_V, J) distinct 
opinions. Assume first that min(7V, J) = J and set 

Tiixj) = min(j/(J - 1), 1) for j = 0, 1, . . . , N - 1 

where Xo, Xi, . . . , Xn— 1 denote the vertices. Since e < ( J — 1) _1 the configura- 
tion thus defined is an absorbing state with J distinct opinions. Finally, when 
min(_V, J) = N we have e < (N — 1) _1 and the same argument implies the exis- 
tence of an absorbing state with N opinions. □ 

With Lemmas 2.1-2.4 in hands, we can now prove Theorems 1.1 and 1.5. 

Proof of Theorem 1.1: Assume first that e < (x — and fix a coloring of the 
vertices using exactly x colors, say 1,2, ... ,x, such that no two adjacent vertices 
receive the same color. Also introduce a total order on the vertex set by setting 

V = {xi,x 2 ,...,x N } 

and fix a > sufficiently small such that e + 2Na < (\ — l) -1 . If vertex x_ has 
received color j ^ \ then we set 

r](xi) = (i- 1) • (X- l) -1 +ia 
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whereas if vertex Xi has received color j = x then we set 

Vfa) = U - !) ' (x - - ia = l-ia. 

See Figure 2.2 for an example of construction on a 3-colorablc graph. This induces 
a configuration with N distinct opinions. Moreover, if vertices xi and Xj have 
received different colors then 

\r)(xi)- > (x-l)- 1 -2A^a > e. 

Since no two adjacent vertices receive the same color, the configuration r\ is an 
absorbing state, which establishes the first part of Theorem 1.1. We now assume 
that e > (x — I) -1 ! an d consider the same coloring as before. Let Nj denote the 
number of vertices with color j. Since 

N l +N 2 + ---+N x = N and Nj € Z + for j = 1, 2, . . . , X 

there is jo such that Nj > \Njx\ . Let a > such that e + Na < 1, and define 

f ia for all xi 6 V with color jo 
\ 1 for all Xi £ V with color j ^ jo 

Since no two adjacent vertices have color jo and 1 — Na > e, this defines an 
absorbing state with Nj + 1 > \N/x \ + 1 opinions. To prove that /U e (G) > [e _1 l, 
we consider the complete graph G with the same vertex set V as G, and apply 
Lemmas 2.2 and 2.4 to get 

/i £ (G) > Me(G) = minCTVje- 1 ]) = [e" 1 !. 

The last equality follows from the fact that [e~ 1 ]<X — 1 < N. □ 

Proof of Theorem 1.5: Let V\ = W and V2 = V \ W, where W C V is a maximal 
clique in the graph G. Then, the subgraph Gi induced by V\ is the complete 
graph with w(G) vertices, and the subgraph induced by V% is Gi = 7r(G) therefore, 
combining Lemmas 2.3 and 2.4, we obtain 

/ie(G) < /Ue(Gi) + M e (G 2 ) < min(w(G), Te- 1 ]) + ^(tt(G)). 

Since this holds for any choice of W, the result follows by induction. □ 

3. Proof of Theorem 1.6 

Recall that e > 1/2. We introduce 

6 t = {x G 1/ : Jjt(x) i (1 - e, e)} and t = card(6 t ). 
Using the notations of the proof of Lemma 2.1, we have 

@ t = U e *( x ) and * = 6t ^- 

x£©o i£0o 

The proof of Lemma 2.1 implies that 9 t (x) is a martingale so the process 9t also 
is a martingale. Therefore, by the martingale convergence theorem, there exists a 
random variable 9^ such that 

lim 9t = 9oo with probability 1. 
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If Ot £ {0, N} then there exists (x, y) G E such that ie6 t and y ^ 9 t . Since any 
opinion in the interval (1 — e, e) is within distance e of both endpoints and 1, we 
can conclude that vertices x and y have the following three properties: 

\Vt(x) ~Vt(y)\ < e Vt(x) ^ r)t(y) and a; ~ j/, 

which implies that r/ t is not an absorbing state. Since by Lemma 2.1 the process 
converges to an absorbing state, we deduce that the random variable 9^, takes 
values in {0,iV}. Let T be the first time the process hits an absorbing state. The 
optional sampling theorem implies that 

E e (9 T ) = E e (6 ) = NxP e (xeG ) = 2(1 -e)N 

= £ e (M = 0xP e (6 oo =0)+N xP e (9 00 =N), 

from which it follows that P e (Boo = 0) = 1 — 2(1 — e) = 2e — 1. To conclude, we 
observe that on the event that 6*00 = 0, all the opinions present in the system after 
time T are within distance 2e — 1 < e of each other so the process evolves according 
to a voter model. In particular, 

P e (v e (G) = l) = P e (consensus) > P e (6 oo = 0) = 2e-l. 

This completes the proof of Theorem 1.6. 

4. Proof of Theorem 1.8 

The key to proving Theorem 1.8 is to investigate a Markov process coupled with 
the opinion dynamics that keeps track of the disagreements along the edges of the 
graph rather than the actual opinion at each vertex. The state at time t can be 
viewed as a weighted graph where each edge is assigned a weight that measures 
the opinion distance between its endpoints. The process evolves until each edge 
has either weight zero or an absolute weight larger than the confidence threshold e, 
which corresponds to an absorbing state. The strategy is to first exhibit a relation- 
ship between the ultimate number of coexisting opinions and the ultimate number 
of edges with weight zero, and then prove that the probability that the fraction of 
edges with weight zero exceeds a certain threshold decreases exponentially with the 
number of vertices. 

Dynamics on a general finite connected graph. To define dynamics on the 
edges of the graph coupled with the opinion dynamics, we first equip the vertex set 
with a total order relationship by setting V = {x\,x%, . . . ,xn}, which implicitly 
induces an orientation of the edges. That is, we think of each edge of the graph as 
being oriented with 

e = (xi, xj) 6 E being oriented from Xi to Xj if and only if i < j. 

By convention, if e = (x, y) is oriented from x to y, we write e\ = x and e2 = y. 
Then, the opinion dynamics on the vertex set described by the process r\ t along 
with an arbitrary orientation of the graph naturally induce dynamics on the edges 
by letting 

6(e) = Vtfa) -*7t(ei) for all e e E. 
The stochastic process thus defined is a continuous-time Markov process whose state 
space includes all mappings of the edge set E into the interval [—1,1]. Moreover, 
since a vertex adopts the opinion of a given neighbor at rate 1 if and only if the 
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opinion distance between both vertices is less than e, the dynamics on the edge set 
are described by the Markov generator Le defined by 

WO = E l{l£(e)l<e}b(£+e)+ffK-e)-2fl(0] 

where £ +e and £_ e are the configurations on the edges given by 

, „ = | C(e') + £(e) when e' a = e 2 

\ C( e ') - £( e ) when e' 2 = e 2 

f £(e') - £(e) when ei = ei 

el6j when e 2 = ei 

while £+ e and £_ e coincide with £ otherwise. Note that £+ e (e) = £- e (e) = due 
to an agreement between the endpoints of e. Thinking of £(e) as a weight, cither 
a positive or a negative weight, of edge e, the evolution rules can be described 
informally as follows. Independently and at rate two, the edge e = (x, y) chooses 
one of its endpoints, say x, uniformly at random. If the absolute value of the weight 
of edge e is less than e, then the weight of each of the edges incident to x is updated 
by summing or subtracting, depending on the orientation of the graph, the weight of 
e, which results in particular in the weight of e becoming zero. Otherwise, nothing 
happens. Note that if originally edge e has weight zero then the previous rule has 
no effect on the configuration. In particular, a configuration such that each edge 
has either weight zero or an absolute weight larger than e is an absorbing state, 
and vice versa. This motivates the following definition. 

Definition 4.1. For j = 1, 2, . . . , J = [e" 1 ] , an edge e G E is said to be 

(1) empty at time t whenever £t(e) = 0, 

(2) of type j at time t if its absolute weight is such that (j — 1) e < |£t(e)| < je. 

In the previous definition, weights equal to a multiple of e different from are 
ignored because, as the difference between two numbers taken from a set of inde- 
pendent continuous random variables, they appear with probability 0. 

Dynamics on a path-like graph. When the graph is a path, the vertex set 
is equipped with a natural order relationship such that 

xi ~ Xj if and only if |i — j\ = 1. 

For simplicity, we shall use this order relationship to study the process on a path as 
it implies that two edges interacting result in one empty edge and one edge whose 
weight is obtained by summing, rather than subtracting, the weights of the edges: 

£+ e (e') = £_ e (e') = £(e) + £(e') for all e, e' G E with e n e! + 0. 

To describe the dynamics, it is also convenient to add two vertices, say xq and 
xn+i, which are connected by a virtual edge to x\ and xjy, respectively. Then, the 
process on a path can be simply described as follows: each weight on a non-virtual 
edge with absolute value less than e is displaced at rate 2 to one of the adjacent 
edges chosen uniformly at random. Note that the weight of both virtual edges is 
unimportant since it has no effect on the weight of the other edges. 
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Number of opinions on trees and path-like graphs. As previously men- 
tioned, the number of opinions and the number of empty edges are closely related. 
More precisely, we prove that for a path-like graph the sum of these two quantities 
is always equal to the number of vertices. We first prove one inequality which holds 
for general connected graphs with no cycles. The result is sharp in the sense that 
it fails for connected graphs with cycles. 

Lemma 4.2. Assume that the connected graph G = (V, E) is a tree. Then 
card{f? t (» : x G V} + card {e G E : £ t (e) = 0} < N. 

Proof: As a subgraph of a tree, the set of empty edges at time t along with all TV 
vertices of the graph is a forest. Since two vertices belonging to the same tree in 
this forest are connected by a path of empty edges, they share the same opinion. 
In particular, the total number of opinions is at most equal to the number of trees 
in the forest, say T. Using in addition that for any tree the number of vertices 
minus the number of edges equals one, we also have that N is equal to the number 
of empty edges plus the number of trees T. In summary, we have 

c&vd{r] t {x) : x G V} < T and card {e 6 E : &(e) = 0} = N-T 

from which the lemma follows. □ 

Having a graph with a self-avoiding loop of length m and assuming that all the 
vertices in this loop share the same opinion but that all the other vertices have 
different opinions gives 

card {rjt(x) : x E V} = N — m + 1 and card{e G E : £t(e) = 0} = m, 

so the assumption that the connected graph is a tree in the previous lemma is 
necessary. The next result states that the inequality in Lemma 4.2 becomes an 
equality on a path-like graph, which is again sharp in the sense that this does not 
hold if at least one vertex has degree 3. 

Lemma 4.3. If G — {V,E) is a path and initial opinions are all distinct then 

card {774(2;) :x&V} + card{e G E : £ t (e) = 0} = N. 

Proof: Since the initial opinions are all distinct, 

card {770(2;) : x 6 V} = N and card{e G E : £ (e) = 0} = 0. 

Hence, the property to be proved is true at time 0. To prove that it holds at any 
time, we introduce the following two definitions. We say that an opinion is lost at 
time t if the last representative of an opinion mimics one of its neighbors at time t, 
which is equivalent to 

card {r] t (x) : x E V} = card{ry t _(x) : x € V} — 1. 

Similarly, we say that a weight is lost at time t if an update occurs at vertex x 
at time t when all the edges incident to x have weight different from 0, which is 
equivalent to 

card {e G E : £ t (e) = 0} > card {e G E : (e) = 0} + 1. 
Now, assume that an opinion is lost at time t. Then, there exists x G V such that 

(i) r/ t -(x) 7^ Vt-( z ) for all z 7^ x and (ii) rj t (x) = r] t (y) for some y ~ x. 
Applying property (i) above for all z ~ x and letting e = (x,y), it follows that 
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Figure 4.3. Realization on a tree with \E — A\. \E — B\, \E — C\, 
\D-A\ < e < \E-D\. 

(1) £t_(e') ^ for all edges e' £ E with one cndpoint at x and 

(2) Z t (e) = r h (y)-r h (x)=0, 

which implies that a weight is lost at time t. Also, we observe that at each update 
of the process one edge becomes empty. Because on a path graph the weight of 
at most two edges can change simultaneously, it follows that the number of empty 
edges cannot decrease. In conclusion, since each time an opinion is lost a weight is 
also lost, and the number of empty edges cannot decrease, the number of opinions 
lost up to time t is smaller than the number of weights lost up to time t which, 
together with the fact that the initial number of opinions is N, implies that 

card{?7 t (x) : x E V} + card{e G E : £t(e) = 0} > N. 

The reverse inequality follows from Lemma 4.2. This completes the proof. □ 

Note that the assumption that the graph is a path-like graph is necessary as shown 
by the simple counter-example of Figure 4.3. In this example, the opinion dynamics 
start with five independent uniformly distributed opinions satisfying the condition 
in the caption. Dashed lines refer to empty edges, continuous lines to edges of type 
1, and dotted lines to edges with absolute weight more than e. The final configu- 
ration on the right-hand side is an absorbing state with only two empty edges and 
two distinct opinions while the graph has five vertices. Note also that the previous 
lemma indicates that, on a path, an opinion is lost at time t if and only if exactly 
one weight is lost at time t, therefore when the weight of an edge of type 1 is added 
to the weight of another edge, the resulting weight is almost surely different from 
0. More generally, 

6(e) + ft(e') = implies that £ t (e) = &(e') = for all e, e' e E. 

This result can be proved directly without invoking Lemma 4.3 by using the one- 
dimensionality of the graph and the fact that each weight is the difference of two 
values chosen from a finite set of independent continuous random variables. 

Connection with a simple urn problem. We assume from now on that the 
graph is a path-like graph, and focus on the proof of our last result, Theorem 1.8. 
To deduce the theorem from Lemma 4.3, it suffices to find a suitable upper bound 
for the ultimate number of empty edges. The idea is to study the maximum number 
of weights that are lost before absorption over all the possible realizations of the 
process by ignoring the spatial structure, i.e., at each update, an edge of type 1 
along with another edge are chosen uniformly at random to interact. Let 
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Box Box 1 Box 2 Box 3 

Figure 4.4. Schematic illustration of strategy S 

denote the number of edges of type j, and X t (0) the number empty edges at time 
t. The first step is to bound the initial number of edges of any type, as shown in 
the following lemma. 

Lemma 4.4. Let J = [e^ 1 ] and M = |"4eAT| - 1. Then, 

P e (X (j)>M) < cxp(-eTV) for all j = 1,2,..., J. 

Proof: Wc first fix j and observe that, since the random variables r)o(x), x G V, are 
uniformly distributed in the interval [0, 1], we have 

P e ((j - l)e < |£ (e)| < je | %(ei) = a) = P t ((j - l)e < |%(e 2 ) - a\ < je) 

< Pc(a+ (j - 1) e < 77o(e 2 ) < a + je) + P e (a- je < ?7 (e 2 ) 

< a-(j-l)e) < 2e 

for all e = (ei,e2) £ -E and a € [0,1]. This, together with the fact that the 
graph does not contain any loops and that the initial opinions arc independent, 
implies that the initial number of edges of type j is stochastically smaller than the 
binomial random variable X ~ Binomial (N — 1, 2e). Note that the first parameter 
corresponds to the number of edges of the graph which, in the case of a path and 
more generally a tree, is equal to the number of vertices minus one. In particular, 
it follows from large deviation estimates for the Binomial distribution that 

P £ (X (j) > M) < P(X> [4eAn - 1) 

< cxp(-27Ve 2 /(2e)) < cxp(-eiV). 

This completes the proof. □ 

Now, wc observe that an interaction between an edge of type 1 and an empty edge 
does not affect the number of edges of each type. Therefore, since we now ignore the 
structure of the network, we shall also ignore such events, i.e., each realization of 
the process corresponds to a random sequence of edges of type 1 becoming empty 
interacting with non-empty edges. In particular, using that each time a weight 
is lost, the number of empty edges increases by 1 according to Lemma 4.3, the 
ultimate number of empty edges is also equal to the number of updates before 
absorption of the process. Also observe that an interaction between an edge of type 
1 and an edge of type j results in the first edge becoming empty and the second 
edge becoming of type either j — 1 or j or j + 1. 

To keep track of the process X t we now play the following game: we consider a 
set of J + 1 boxes, labelled from box to box J, with box j ^ containing initially 
a certain random number of balls and box being initially empty. Then, at each 
time step, we do the following. 
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(1) Edge becoming empty: move a ball from box 1 to box 0, 

(2) Second interacting edge: choose a box j ^ uniformly at random from the 
set of all the non-empty boxes, 

(3) Updated weight: move a ball from this box to one of the three boxes j — 1, 
j or j + 1 chosen uniformly at random with probability 1/3. 

The game stops when box 1 is empty. We now think of the number of balls in box 
j =/= as the number of edges of type j, and the number of balls in box as the 
number of empty edges, and observe that standard coupling arguments imply that 
the number of steps to empty box 1 increases stochastically if wc add more balls 
at time 0. In particular, the reasoning above implies that, given that the initial 
number of edges of each type is at most M = |~4eiV] — 1, the ultimate number 
of empty edges is bounded by the maximum number of steps over all the possible 
realizations of the game to empty box 1 when starting with M balls in each box 
j ^ 0, which is reached for the strategy that will keep box 1 non-empty as long as 
possible. Since in addition at least j — 1 steps are required to move a ball from box 
j to box 1 (exactly j — 1 steps by always moving the ball to the left), the maximum 
number of steps to empty box 1 over all the possible realizations of the game is 
reached when following the deterministic strategy S that consists at each step in 

(1) Moving a ball from box 1 to box 0, 

(2) Choosing the non-empty box with the lowest label j > 2, 

(3) Taking a ball from this box and moving it to box j — 1. 

Letting Y n (J) denote the deterministic number of balls in box j at step n under 
strategy S when starting with exactly M balls in each box j ^ 0, we obtain the 
following values: 

Y n (l) = M and F„(2) =M-n 

for all n = 0, 1, . . . , M, and 

Y n+M (l) = M — \n/2\ , Y n+M (2) = l{n odd} and Y n+M (3) = M - \n/2\ 

for all 77. = 1, . . . , 2M. Balls from box j > 4 are never moved and the game halts 
after 3Af steps, which implies that the event that 

lim X t (0) > 3M and X (j) < M for j = 1,2,3 

t— >oo 

is the empty event. This and Lemmas 4.3 and 4.4 allow to conclude that 

P e (v e (G) < (1 - l2e)N) < P e {v c {G) <N- MI) < P, (lim^ X t (0) > 3M) 

< P e (limt-voo X t (0) > 3M and X (j) < M for j = 1, 2, 3) 

+ P t (lim^oo X t (0) > 3M and X (j) > M for some j = 1, 2, 3) 

< P e (0) + P t (Umj^oo X t (0) > 3M and X (j) > M for some j = 1, 2, 3) 

< + 3 cxp(-eTV). 

This completes the proof of Theorem 1.8. 
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